Interactive Perception : From Scenes to Objects

نویسنده

  • Niklas Bergström
چکیده

This thesis builds on the observation that robots, like humans, do not have enough experience to handle all situations from the start. Therefore they need tools to cope with new situations, unknown scenes and unknown objects. In particular, this thesis addresses objects. How can a robot realize what objects are if it looks at a scene and has no knowledge about objects? How can it recover from situations where its hypotheses about what it sees are wrong? Even if it has built up experience in form of learned objects, there will be situations where it will be uncertain or mistaken, and will therefore still need the ability to correct errors. Much of our daily lives involves interactions with objects, and the same will be true robots existing among us. Apart from being able to identify individual objects, the robot will therefore need to manipulate them. Throughout the thesis, di erent aspects of how to deal with these questions is addressed. The focus is on the problem of a robot automatically partitioning a scene into its constituting objects. It is assumed that the robot does not know about specific objects, and is therefore considered inexperienced. Instead a method is proposed that generates object hypotheses given visual input, and then enables the robot to recover from erroneous hypotheses. This is done by the robot drawing from a human’s experience, as well as by enabling it to interact with the scene itself and monitoring if the observed changes are in line with its current beliefs about the scene’s structure. Furthermore, the task of object manipulation for unknown objects is explored. This is also used as a motivation why the scene partitioning problem is essential to solve. Finally aspects of monitoring the outcome of a manipulation is investigated by observing the evolution of flexible objects in both static and dynamic scenes. All methods that were developed for this thesis have been tested and evaluated on real robotic platforms. These evaluations show the importance of having a system capable of recovering from errors and that the robot can take advantage of human experience using just simple commands.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Consistency effects between objects in scenes.

How does context influence the perception of objects in scenes? Objects appear in a given setting with surrounding objects. Do objects in scenes exert contextual influences on each other? Do these influences interact with background consistency? In three experiments, we investigated the role of object-to-object context on object and scene perception. Objects (Experiments 1 and 3) and background...

متن کامل

Object detection in natural scenes: Independent effects of spatial and category-based attention

Humans are remarkably efficient in detecting highly familiar object categories in natural scenes, with evidence suggesting that such object detection can be performed in the (near) absence of attention. Here we systematically explored the influences of both spatial attention and category-based attention on the accuracy of object detection in natural scenes. Manipulating both types of attention ...

متن کامل

How does context influence the perception of objects

and scenes? Although recognition experiments typically investigate objects in isolation, objects in the world rarely appear without some context. Objects are always located spatially within a setting and usually appear with other objects. Does contextual information affect recognition? For example, a new acquaintance may be recognized more quickly in the presence of a mutual friend than alone. ...

متن کامل

Moving Objects Tracking Using Statistical Models

Object detection plays an important role in successfulness of a wide range of applications that involve images as input data. In this paper we have presented a new approach for background modeling by nonconsecutive frames differencing. Direction and velocity of moving objects have been extracted in order to get an appropriate sequence of frames to perform frame subtraction. Stationary parts of ...

متن کامل

Acquisition Viewpoints - Efficient and Versatile View - Dependent Modeling of Real - World Scenes

Three dimensional modeling is a severe bottleneck for computer graphics applications. Manual modeling is time consuming and, even so, the resulting models fail to capture the true complexity of real world scenes. Automated modeling based on acquiring color and depth data is a promising alternative. We propose an automated modeling approach based on sampling the scene sparsely from a dense set o...

متن کامل

Interactive Coding of Visual Spatial Frequency and Auditory Amplitude-Modulation Rate

Spatial frequency is a fundamental visual feature coded in primary visual cortex, relevant for perceiving textures, objects, hierarchical structures, and scenes, as well as for directing attention and eye movements. Temporal amplitude-modulation (AM) rate is a fundamental auditory feature coded in primary auditory cortex, relevant for perceiving auditory objects, scenes, and speech. Spatial fre...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012